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Abstract 

Dirty paper coding (DPC) allows a transmitter to send information to a receiver in the presence of 
interference that is known (non-causally) to the transmitter. The original version of DPC was derived for 
the case where the noise and the interference are statistically independent Gaussian random sequences. 
More recent works extended this approach to the case where the noise and the interference are mutually 
independent and at least one of them is Gaussian. In this letter we further extend the DPC scheme 
by relaxing the Gaussian and statistical independence assumptions. We provide lower bounds on the 
achievable data rates in a DPC setting for the case of possibly dependent noise, interference and input 
signals. Also, the interference and noise terms are allowed to have arbitrary probability distributions. 
The bounds are relatively simple, are phrased in terms of second-order statistics, and are tight when the 
actual noise distribution is close to Gaussian. 


I. Introduction 

The exponential growth of wireless data traffie on one hand and the limited availability of 
appropriate speetrum on the other, generates a strong motivation to significantly improve the 
spectral efficiency per unit area offered by wireless systems. One clear example is today’s 
cellular networks, where the overall data traffic grew by more than an order of magnitude 
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during just the past four years |[T1, O, and this trend is expected to continue and intensify. 
The conventional approach of deploying more base stations per unit area is no longer a valid 
cost-effective mechanism to supply the huge growth in traffic demand. Therefore, more advanced 
techniques such as cooperative transmission and interference mitigation at the transmitter side 
are being considered (e.g., Q) and standardized (e.g., (H) for the next generation of cellular 
networks. 

In 1983, Costa presented the dirty paper coding scheme (DPC) [|5l|. This scheme allows/ m// 
cancellation of interference at the transmitter side, and leads to interference free data rates. The 
DPC scheme attracted considerable interest in recent years, and state-of-the-art DPC approaches 
are applicable for the case where the noise and the interference terms are mutually independent 
of each other and of the input signal and at least one of these terms is Gaussian [0-[[8l|. In 
this work we further relax the model assumptions and consider the application of DPC to 
the case of non-Gaussian channels where the noise and interference terms may have arbitrary 
probability distributions, and may also be statistically dependent of each other and/or of the 
input signal. In addition to the obvious application of DPC for channels with non-Gaussian 
noise and interference, other common applications include: interference cancellation with partial 
channel state information (CSI) (e.g. dUl, ifTOl ). Uplink-Downlink capacity balancing via variable 
feedback rate ca, water marking (e.g., (fill, Ifia i. and more. 

We present a lower bound on the achievable rate in a DPC setting. The bound is based on 
choosing a Gaussian input distribution, and then proving that the effect of the interference can 
be completely eliminated, while the effect of the noise is bounded by the effect of a Gaussian 
noise with the same second order statistics. In the case of uncorrelated noise, the bounding data 
rate is identical to the capacity of the AWGN channel with the same noise variance (and without 
interference). This bound may also be viewed as an extension of the ‘Gaussian noise is the worst 
noise’ principle (e.g., ifTSll - lfTSll l to the case of DPC. Yet, the arguments that prove the worst 
noise principle do not hold in the DPC case, and a different proof is provided. Furthermore, one 
needs to recall that even in the absence of interference, the Gaussian noise is the worst noise 
only if the input distribution is Gaussian (as an example see ifT^ l. In the presence of interference 
the situation is even more intricate. 

The reminder of this paper is organized as follows: Section |n] presents the system model and 
briefly discuss the results of previous works. Section |nl] presents our main results and Section 
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HVl details their proofs. Seetion |V] eontains our eoneluding remarks. 

II. System model & Previous works 
We eonsider the additive noise additive interferenee ehannel, given by: 

Y = H-X + Z + N. (1) 

where X is the transmitted data symbol, H is a random ehannel gain and Z is an interferenee 
term with variance of cr|. It is assumed that the transmitter has non-casual knowledge of both 
the channel gain, H, and the interference term, Z, while the receiver knows only the channel 
gain. We consider a general additive noise, N, which may depend on the transmitted symbol, 
X, and the interference, Z. The channel is assumed to be memoryless, and obeys the probability 
lawQ: 

fH,z,N\xih, z, n\x) = fN\x,z{n\x, z)fz{z)fH{h) (2) 

where the distribution of the interference, fz{z), the distribution of the channel, fnih), and 
the conditional distribution of the noise, fN\x,zin\x,z), are given, while the transmitter can 
choose the conditional distribution of the signal given the known interference and the channel, 
to maximize the rate. 

We assume throughout an independent and identically distributed (iid) setting, i.e., H, Z and 
N are assumed to be composed of sequences of iid random variables (given X). Note that 
the results can be trivially extended to any power-limited ergodic distributions, by using an 
interleaving argument. 

A. Previous works 

The achievable rates in the presence of known interference of general distribution is charac¬ 
terized by the classical Gel’fand-Pinsker formula [fT^ : 

R = I{U;Y\H) - I{U-,Z\H) (3) 

’in this work we consider the case of real valued signals. The results hold also for the complex valued case with proper 
complex terms 1171 . but the factor 1/2 in Equations @ and 0 should be removed. 
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where U is an auxiliary random variable. This rate is achievable using random coding and 
binning with iid signaling, where 

fu,x,H,z,Niu, X, h, z, n) = fN\x,zin\x, z)fH{h)fziz)fx\u,zix\u, z, h)fjj\z{u\z, h). (4) 

But, the optimal setting of this auxiliary variable in general is yet unknown, and so is the 
corresponding maximal achievable rate. 

The DPC scheme gave the optimal choice of the auxiliary variable for the case of statistically 
independent noise, N, and interference, Z, both Gaussian processes |l5l. Later, it was extended 
to the case of Gaussian noise and arbitrary interference 0, [[8]|. Additionally, for the case of 
Gaussian interference Cohen and Lapidoth (bll showed that Gaussian noise gives a lower bound 
on the achievable rate for all noise distributions that are statistically independent of the input 
and the interference. 

Costa showed m that in the presence of additive Gaussian noise, the channel capacity is 
identical to the capacity of the channel without interference. Cohen and Lapidoth lfT9ll termed 
this property private-public equivalence (PPE) and showed that it does not hold for any noise 
distribution. They even conjectured that the PPE property holds only for additive Gaussian noise. 

To further emphasize the effect of non-Gaussian noise, Cohen and Zamir [|^ gave an example 
in which the difference between the channel capacity with and without interference is not even 
bounded. They further speculated that the DPC scheme is efficient only if the noise distribution 
is Gaussian. Yet, [l20l addressed only channels with discrete inputs, and hence, the distribution 
of the input, X, could not be Gaussian. In this work we take a different approach, and show 
that the capacity of the channel can be lower bounded when the input distribution is allowed to 
be Gaussian. 

The presented lower bound is equal to the no-interference capacity only if the noise and the 
input symbols are jointly Gaussian. Hence, the presented bound does not contradict the PPE 
conjecture of Cohen and Eapidoth lfT9l . 

III. Main results 

Defining the variance and correlation coefficient of general random variables, V and W as 

a^ = E [{V -E[V]f] and 

E[{y - E[V]) {W - E[W])] 


Pvw 


(JV • CTiy 


( 5 ) 
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respectively, a lower bound on the capacity of the additive noise additive interference channel 
is given by the following theorem. 

Theorem 1: Let Xq be a Gaussian random variable with zero mean and variance of aj^, and 
denote its distribution by fx^ix). Define also the random variable Nq, so that Nq, Z) follow 
the joint distribution: fxa,NG,z{x,n,z) = fN\x,z{n\x,z)fz{z)fxG{x)- 

The capacity of the channel presented in ([T]), subject to the average power constraint: < 

ax, is lower bounded by: 



( 6 ) 


Proof: The proof is given in Section |IVl 


The main importance of Theorem [T] is its ability to lower bound the capacity of an arbitrary 
additive channel, based only on second order statistics of the relevant signals. Furthermore, 
the bound does not depend on the interference variance and holds also for interference with 
unbounded variance. The theorem shows that correlation between the (known) interference and 
the noise is always beneficial and increases the lower bouncj^. On the other hand, correlation 
between the noise and the desired signal can be beneficial or harmful, depending on the signs 
of the cross-correlation coefficient and the channel gain. 

Note that the bound can be further tightened by adjusting the instantaneous transmission power 
according to the channel state. The optimization of the power allocation scheme is a variation 
of the well known water-pouring solution ETI . which is somewhat more complicated due to the 
possible dependency of the noise power and the correlation coefficients on the instantaneous 
transmission power. The extension for variable power allocation is straight-forward, and is 
therefore omitted. 

A simpler version of the bound is presented in Corollary [H where we apply Theorem [T] to 
the case that the X and N are not correlated. 

Corollary 1 (Uncorrelated noise): If, in addition to the conditions of Theorem [H the condi¬ 
tional distribution of the noise is such that E[N\X = x] = E[N] for any x, then the capacity 

^In the presence of correlation between the (known) interference and the (unknown) noise, the noise can be decomposed into 
two parts. The first part is known (given the interference) while the second part is unknown. The DPC scheme can mitigate the 
effect of the known part of the noise, and hence reduce the effect of the noise and increase the capacity 
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of the channel is lower bounded by: 

R > Eh 


llog. 


1 + 


H^-a 


X 


a 


No 


( 7 ) 


Proof: The corollary’s condition leads to E [X {N — E[N])] = 0 for any distribution of 
X, and in particular pxaNa = 0. The proof follows immediately by substituting this zero cross¬ 
correlation into dH) and noting that PzNq > 0 only increase the bound. ■ 

Note that © is also a lower bound on the capacity of the interference free channel. The bound 
is tight if the noise is Gaussian, and the interference is statistically independent of the noise. 

The condition in Corollary [T] includes also the simpler case in which the noise is statistically 
independent of the transmitted signal. In such case, the noise variance is independent of the 
input signal, and the term in © can be replaced by a%. If, in addition, the noise, N, and 
interference, Z, are statistically independent, and at least one of them is Gaussian, then the 
bound in © converge to the results of |l6l, ©, (H. 

Corollary [T] is particularly useful when © is close to the interference free channel capacity. 
In such cases, the corollary shows that DPC is useful even in imperfect channel settings, and 
gives theoretical basis for the analysis of DPC in such cases (e.g., in broadcast channels with 
imperfect CSI). 

The results above were derived using the following lemma, which is more general and allows 
the derivation of tighter bounds than ®. 

Lemma 1: The achievable rate in the channel of © is lower bounded by 


R > 


sup E 

fx\H,Z 


h{X\H 


ri, Z) - min h ((1 - I37])X + {a - (3)Z - pxlH = rj) 

0,0 


( 8 ) 


where h{») denotes the differential entropy and the distribution of N is evaluated according to 
the chosen distribution of X. If the problem has constraints on the input distribution, then any 
tested distribution of X must satisfy these constraints. 

Proof: See Section UVl ■ 

The lemma provides an easy way to derive expressions that lower bound the channel capacity. 
In particular, we note that any distribution of X and any choice of the parameters a and fi that 
satisfy the problem constraints will lead to a valid lower bound. A particularly interesting choice 
of parameters is given by a = /9, where the interference term in the right hand entropy in ® 
is zeroed. If in addition, the distribution of the input symbol is chosen so that X is statistically 
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independent of the known interferenee given H {fx\H,z = fx\H), then the evaluation of the 
bound ean be performed without any knowledge of the statistieal nature of the interferenee. 


IV. Proofs 

A. Proof of Lemma [7] 

Using the Gel’fand and Pinsker result, ([3]), an aehievable rate is given by: 

R = h{U\H) - h{U\H, Y) - h{U\H) + h{U\H, Z) 

= h{U\H, Z) - h{U\H, Y) = h{U\H, Z) - h{U - BY\H, Y) 
>h{U\H,Z)-h{U-BY\H) (9) 

where B ean be any funetion of the ehannel, H. To derive the lower bound, we further limit 
the diseussion to an auxiliary variable that is a linear eombination: 

U = X + AZ (10) 


where A may depend on the ehannel H. Thus we have: 


R > h{X\H, Z)-h({l- BH)X + (A - B)Z - BN 


H 


( 11 ) 


Optimizing with respeet to the input distribution and the ehoiee of the variables A and B, and 
reealling that the notation h{X\H) means Eri[h{X\H = rj)], leads to ([8]) and eompletes the proof 
of the lemma. 


B. Proof of Theorem [7] 

As we derive a lower bound, we ean limit the diseussion to any ehoiee of (eonditional) input 
distribution. We find it eonvenient to ehoose X to have the distribution of Xq, i.e., to have 
a Gaussian distribution with zero mean and a varianee aj^. Furthermore, X is assumed to be 
statistieally independent of Z and H. We write the resulting ehannel model as: 

Yg = H-Xg + Z + Ng. (12) 

We note also that pxqZ = E[Xg{Z — £'[Z])] = 0. 



Theorem [T] ean be proved directly by optimizing the constants a and /3 in ([8]). A shorter 
proof can be obtained by first removing the noise correlations. In order to account for the noise 
correlation, we rewrite (fT^ as: 


Yg = H -Xn + WZ + N 


(13) 


where: 


H — H + PxqNg 
W = 1 + Pzng 


(TX 

^Ng 

(TZ 


and 


N — Nq — PxgNg~^^g — PzNg~^Z. 

O'x CTZ 


(14) 

(15) 

(16) 


One can easily verify that: 


Xg{N-E[N]) =E {Z - E[z]) {N - E[N]) 


= 0 . 


Note that the resulting noise variance is: 


“ (l “ PXgNg ~ PzNg)^"^^ 


(17) 


'N ~ V" yXGNG HZNgJ^Ng- 

Using Lemma[I]on the virtual channel of (fT3l) . and recalling that Xq is statistically independent 
of Z and H, we have: 


R> E 


hiXo) - min (1 - ^p)Xg + {a - ^)WZ - ^N\H = p 


(18) 


Inspecting (fT^ . the differential entropy in the first line is the entropy of a Gaussian random 
variable, while the differential entropy in the second line can be upper bounded by the entropy 
of a Gaussian random variable with the same variance. Thus, the achievable rate satisfies: 


R > -log2(27recr|) - E 


loga {27reQ{H)) 


(19) 


where 


Q{p) = min Var f(l - ^p)Xg + {a - ^)WZ - 0) 

01,13 V / 


= niin(l - f^pYa'x + (a - 

Q:,p 


( 20 ) 














and the last line of (l20l) used the faet the Xq, Z and N are uneorrelated. To obtain the tightest 
bound, we need to minimize Q{r]) with respeet to a and (3. As the parameter a affeets only the 
middle term in the last line of (l20l) . the optimal value of a is obviously a = (3, leading to: 

Q(r/) = mm(l - I3vifa\ + /3VJ. (21) 

To find the optimal value of (3, we take the derivative of (|2T]) with respeet to [3 and equate to 0: 


-2r]{l - (3ri)a\ + 2(3a% = 0 


( 22 ) 


resulting with (3 = rja\/{'r]^a\ + a^^). 

Substituting baek in (|2T]) gives the minimal value of: 

Q(^) = = 

Substituting (|2^ baek in (fT9l) results with 

T 


N ^ 


Tfa\ + ^ 


N 


R> E 


log2 1 + 


h (Jx 


a 


N 


(23) 


(24) 


To eomplete the proof we need to substitute (fT4l) and (fTTI) . resulting with ®. ■ 

We note that an alternative proof, based on the multi-dimensional lattiee approaeh of Khina and 
Erez ll22l . is presented in [|23]l for the speeial ease where the noise is assumed to be uneorrelated 
with the input and the interferenee terms. This alternative proof has the advantage of being a 
eonstruetive proof, i.e., it presents a eoding seheme that ean aetually aehieve the bound, but it 
is more restrietive due to the uneorrelation assumption. 


V. CONCLUSIONS 

Lower bounds on the aehievable data rates in an additive noise additive interferenee ehannel 
are derived, for the ease where the interference is known to the transmitter. It is shown that 
dirty paper coding is applicable for this setup even when the noise and the interference are non- 
Gaussian and/or are mutually dependent on each other and on the input signal. Thus, the results 
indicate that a DPC standard precoder, designed for Gaussian noise and known interference, is 
robust in more general channel settings. The relaxed assumptions on the structure of the noise and 
interference signals facilitates the usage of the bounds for the analysis of actual communications 
systems. For these, both the known interference as well as the additive unknown noise may be 







10 


associated with structured signaling, such as transmissions of other users, and hence the Gaussian 
assumption may not fit. 

The bounds are phrased in terms of second order statistics of certain auxiliary variables, and 
are in the general form of the classical Shannon capacity formula for the interference free AWGN 
channel. The bounds are tight when the noise is close to Gaussian and can be interpreted as 
an extension of the well-known result that the ‘Gaussian noise is the worst noise’ to the more 
elaborate interference channel model considered in this letter. 
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